Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 180000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 75.3 MiB |
| Average record size in memory | 438.8 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 5 |
| Boolean | 1 |
cb_person_cred_hist_length is highly overall correlated with person_age and 1 other fields | High correlation |
loan_amnt is highly overall correlated with loan_percent_income | High correlation |
loan_percent_income is highly overall correlated with loan_amnt | High correlation |
loan_status is highly overall correlated with previous_loan_defaults_on_file | High correlation |
person_age is highly overall correlated with cb_person_cred_hist_length and 1 other fields | High correlation |
person_emp_exp is highly overall correlated with cb_person_cred_hist_length and 1 other fields | High correlation |
previous_loan_defaults_on_file is highly overall correlated with loan_status | High correlation |
person_income is highly skewed (γ1 = 34.13672968) | Skewed |
person_id is uniformly distributed | Uniform |
person_id has unique values | Unique |
person_emp_exp has 38264 (21.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-20 03:21:24.127755 |
|---|---|
| Analysis finished | 2024-12-20 03:21:57.225356 |
| Duration | 33.1 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
person_id
Real number (ℝ)
Uniform  Unique 
| Distinct | 180000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 90000.5 |
| Minimum | 1 |
|---|---|
| Maximum | 180000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9000.95 |
| Q1 | 45000.75 |
| median | 90000.5 |
| Q3 | 135000.25 |
| 95-th percentile | 171000.05 |
| Maximum | 180000 |
| Range | 179999 |
| Interquartile range (IQR) | 89999.5 |
Descriptive statistics
| Standard deviation | 51961.669 |
|---|---|
| Coefficient of variation (CV) | 0.57734867 |
| Kurtosis | -1.2 |
| Mean | 90000.5 |
| Median Absolute Deviation (MAD) | 45000 |
| Skewness | 0 |
| Sum | 1.620009 × 1010 |
| Variance | 2.700015 × 109 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 120004 | 1 | < 0.1% |
| 119996 | 1 | < 0.1% |
| 119997 | 1 | < 0.1% |
| 119998 | 1 | < 0.1% |
| 119999 | 1 | < 0.1% |
| 120000 | 1 | < 0.1% |
| 120001 | 1 | < 0.1% |
| 120002 | 1 | < 0.1% |
| 120003 | 1 | < 0.1% |
| Other values (179990) | 179990 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 180000 | 1 | |
| 179999 | 1 | |
| 179998 | 1 | |
| 179997 | 1 | |
| 179996 | 1 | |
| 179995 | 1 | |
| 179994 | 1 | |
| 179993 | 1 | |
| 179992 | 1 | |
| 179991 | 1 |
person_age
Real number (ℝ)
High correlation 
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.764178 |
| Minimum | 20 |
|---|---|
| Maximum | 144 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 24 |
| median | 26 |
| Q3 | 30 |
| 95-th percentile | 39 |
| Maximum | 144 |
| Range | 124 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 6.0450578 |
|---|---|
| Coefficient of variation (CV) | 0.21772868 |
| Kurtosis | 18.647795 |
| Mean | 27.764178 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.5480903 |
| Sum | 4997552 |
| Variance | 36.542724 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 21016 | |
| 24 | 20552 | |
| 25 | 18028 | |
| 22 | 16944 | |
| 26 | 14636 | 8.1% |
| 27 | 12380 | 6.9% |
| 28 | 10912 | 6.1% |
| 29 | 9820 | 5.5% |
| 30 | 8084 | 4.5% |
| 31 | 6580 | 3.7% |
| Other values (50) | 41048 |
| Value | Count | Frequency (%) |
| 20 | 68 | < 0.1% |
| 21 | 5156 | 2.9% |
| 22 | 16944 | |
| 23 | 21016 | |
| 24 | 20552 | |
| 25 | 18028 | |
| 26 | 14636 | |
| 27 | 12380 | |
| 28 | 10912 | |
| 29 | 9820 |
| Value | Count | Frequency (%) |
| 144 | 12 | |
| 123 | 8 | |
| 116 | 4 | < 0.1% |
| 109 | 4 | < 0.1% |
| 94 | 4 | < 0.1% |
| 84 | 4 | < 0.1% |
| 80 | 4 | < 0.1% |
| 78 | 4 | < 0.1% |
| 76 | 4 | < 0.1% |
| 73 | 12 |
person_gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 MiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.8959556 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | female |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 99364 | |
| female | 80636 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 99364 | |
| female | 80636 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 260636 | |
| m | 180000 | |
| a | 180000 | |
| l | 180000 | |
| f | 80636 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 881272 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 260636 | |
| m | 180000 | |
| a | 180000 | |
| l | 180000 | |
| f | 80636 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 881272 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 260636 | |
| m | 180000 | |
| a | 180000 | |
| l | 180000 | |
| f | 80636 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 881272 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 260636 | |
| m | 180000 | |
| a | 180000 | |
| l | 180000 | |
| f | 80636 | 9.1% |
person_education
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.3 MiB |
| Bachelor | |
|---|---|
| Associate | |
| High School | |
| Master | |
| Doctorate | 2484 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.769 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Master |
|---|---|
| 2nd row | High School |
| 3rd row | High School |
| 4th row | Bachelor |
| 5th row | Master |
Common Values
| Value | Count | Frequency (%) |
| Bachelor | 53596 | |
| Associate | 48112 | |
| High School | 47888 | |
| Master | 27920 | |
| Doctorate | 2484 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bachelor | 53596 | |
| associate | 48112 | |
| high | 47888 | |
| school | 47888 | |
| master | 27920 | |
| doctorate | 2484 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 202452 | |
| c | 152080 | |
| h | 149372 | |
| e | 132112 | |
| a | 132112 | |
| s | 124144 | 7.9% |
| l | 101484 | 6.4% |
| i | 96000 | 6.1% |
| r | 84000 | 5.3% |
| t | 81000 | 5.1% |
| Other values (8) | 323664 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1578420 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 202452 | |
| c | 152080 | |
| h | 149372 | |
| e | 132112 | |
| a | 132112 | |
| s | 124144 | 7.9% |
| l | 101484 | 6.4% |
| i | 96000 | 6.1% |
| r | 84000 | 5.3% |
| t | 81000 | 5.1% |
| Other values (8) | 323664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1578420 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 202452 | |
| c | 152080 | |
| h | 149372 | |
| e | 132112 | |
| a | 132112 | |
| s | 124144 | 7.9% |
| l | 101484 | 6.4% |
| i | 96000 | 6.1% |
| r | 84000 | 5.3% |
| t | 81000 | 5.1% |
| Other values (8) | 323664 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1578420 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 202452 | |
| c | 152080 | |
| h | 149372 | |
| e | 132112 | |
| a | 132112 | |
| s | 124144 | 7.9% |
| l | 101484 | 6.4% |
| i | 96000 | 6.1% |
| r | 84000 | 5.3% |
| t | 81000 | 5.1% |
| Other values (8) | 323664 |
person_income
Real number (ℝ)
Skewed 
| Distinct | 33989 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80319.053 |
| Minimum | 8000 |
|---|---|
| Maximum | 7200766 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 8000 |
|---|---|
| 5-th percentile | 28366.7 |
| Q1 | 47204 |
| median | 67048 |
| Q3 | 95789.25 |
| 95-th percentile | 166754.7 |
| Maximum | 7200766 |
| Range | 7192766 |
| Interquartile range (IQR) | 48585.25 |
Descriptive statistics
| Standard deviation | 80421.828 |
|---|---|
| Coefficient of variation (CV) | 1.0012796 |
| Kurtosis | 2398.4848 |
| Mean | 80319.053 |
| Median Absolute Deviation (MAD) | 23124 |
| Skewness | 34.13673 |
| Sum | 1.445743 × 1010 |
| Variance | 6.4676705 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8000 | 60 | < 0.1% |
| 73011 | 40 | < 0.1% |
| 36995 | 36 | < 0.1% |
| 60914 | 32 | < 0.1% |
| 37020 | 32 | < 0.1% |
| 73082 | 28 | < 0.1% |
| 60864 | 28 | < 0.1% |
| 67131 | 28 | < 0.1% |
| 72951 | 28 | < 0.1% |
| 73040 | 28 | < 0.1% |
| Other values (33979) | 179660 |
| Value | Count | Frequency (%) |
| 8000 | 60 | |
| 8037 | 4 | < 0.1% |
| 8104 | 4 | < 0.1% |
| 8186 | 4 | < 0.1% |
| 8248 | 4 | < 0.1% |
| 8267 | 4 | < 0.1% |
| 8277 | 4 | < 0.1% |
| 8302 | 4 | < 0.1% |
| 8518 | 4 | < 0.1% |
| 9364 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 7200766 | 4 | |
| 5556399 | 4 | |
| 5545545 | 4 | |
| 2448661 | 4 | |
| 2280980 | 4 | |
| 2139143 | 4 | |
| 2012954 | 4 | |
| 1741243 | 4 | |
| 1728974 | 4 | |
| 1661567 | 4 |
person_emp_exp
Real number (ℝ)
High correlation  Zeros 
| Distinct | 63 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.4103333 |
| Minimum | 0 |
|---|---|
| Maximum | 125 |
| Zeros | 38264 |
| Zeros (%) | 21.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 17 |
| Maximum | 125 |
| Range | 125 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.0634816 |
|---|---|
| Coefficient of variation (CV) | 1.1207224 |
| Kurtosis | 19.166626 |
| Mean | 5.4103333 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.5948525 |
| Sum | 973860 |
| Variance | 36.765809 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38264 | |
| 2 | 16536 | |
| 1 | 16244 | |
| 3 | 15560 | |
| 4 | 14096 | 7.8% |
| 5 | 12000 | 6.7% |
| 6 | 10868 | 6.0% |
| 7 | 8816 | 4.9% |
| 8 | 7560 | 4.2% |
| 9 | 6300 | 3.5% |
| Other values (53) | 33756 |
| Value | Count | Frequency (%) |
| 0 | 38264 | |
| 1 | 16244 | |
| 2 | 16536 | |
| 3 | 15560 | |
| 4 | 14096 | 7.8% |
| 5 | 12000 | 6.7% |
| 6 | 10868 | 6.0% |
| 7 | 8816 | 4.9% |
| 8 | 7560 | 4.2% |
| 9 | 6300 | 3.5% |
| Value | Count | Frequency (%) |
| 125 | 4 | |
| 124 | 4 | |
| 121 | 4 | |
| 101 | 4 | |
| 100 | 4 | |
| 93 | 4 | |
| 85 | 4 | |
| 76 | 4 | |
| 62 | 4 | |
| 61 | 4 |
person_home_ownership
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.7 MiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 468 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.5804889 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | OWN |
| 3rd row | MORTGAGE |
| 4th row | RENT |
| 5th row | RENT |
Common Values
| Value | Count | Frequency (%) |
| RENT | 93772 | |
| MORTGAGE | 73956 | |
| OWN | 11804 | 6.6% |
| OTHER | 468 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rent | 93772 | |
| mortgage | 73956 | |
| own | 11804 | 6.6% |
| other | 468 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 168196 | |
| E | 168196 | |
| T | 168196 | |
| G | 147912 | |
| N | 105576 | |
| O | 86228 | |
| M | 73956 | |
| A | 73956 | |
| W | 11804 | 1.2% |
| H | 468 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1004488 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 168196 | |
| E | 168196 | |
| T | 168196 | |
| G | 147912 | |
| N | 105576 | |
| O | 86228 | |
| M | 73956 | |
| A | 73956 | |
| W | 11804 | 1.2% |
| H | 468 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1004488 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 168196 | |
| E | 168196 | |
| T | 168196 | |
| G | 147912 | |
| N | 105576 | |
| O | 86228 | |
| M | 73956 | |
| A | 73956 | |
| W | 11804 | 1.2% |
| H | 468 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1004488 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 168196 | |
| E | 168196 | |
| T | 168196 | |
| G | 147912 | |
| N | 105576 | |
| O | 86228 | |
| M | 73956 | |
| A | 73956 | |
| W | 11804 | 1.2% |
| H | 468 | < 0.1% |
cb_person_cred_hist_length
Real number (ℝ)
High correlation 
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8674889 |
| Minimum | 2 |
|---|---|
| Maximum | 30 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 14 |
| Maximum | 30 |
| Range | 28 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.8796695 |
|---|---|
| Coefficient of variation (CV) | 0.66121463 |
| Kurtosis | 3.725534 |
| Mean | 5.8674889 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.6316792 |
| Sum | 1056148 |
| Variance | 15.051836 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 34612 | |
| 3 | 33248 | |
| 2 | 26148 | |
| 5 | 12328 | 6.8% |
| 6 | 11864 | 6.6% |
| 7 | 11556 | 6.4% |
| 8 | 11200 | 6.2% |
| 9 | 10740 | 6.0% |
| 10 | 9828 | 5.5% |
| 12 | 2860 | 1.6% |
| Other values (19) | 15616 |
| Value | Count | Frequency (%) |
| 2 | 26148 | |
| 3 | 33248 | |
| 4 | 34612 | |
| 5 | 12328 | 6.8% |
| 6 | 11864 | 6.6% |
| 7 | 11556 | 6.4% |
| 8 | 11200 | 6.2% |
| 9 | 10740 | 6.0% |
| 10 | 9828 | 5.5% |
| 11 | 2848 | 1.6% |
| Value | Count | Frequency (%) |
| 30 | 92 | |
| 29 | 60 | |
| 28 | 116 | |
| 27 | 92 | |
| 26 | 80 | |
| 25 | 92 | |
| 24 | 136 | |
| 23 | 104 | |
| 22 | 128 | |
| 21 | 96 |
loan_amnt
Real number (ℝ)
High correlation 
| Distinct | 4483 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9583.1576 |
| Minimum | 500 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2000 |
| Q1 | 5000 |
| median | 8000 |
| Q3 | 12237.25 |
| 95-th percentile | 24000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 7237.25 |
Descriptive statistics
| Standard deviation | 6314.8341 |
|---|---|
| Coefficient of variation (CV) | 0.65895129 |
| Kurtosis | 1.3510026 |
| Mean | 9583.1576 |
| Median Absolute Deviation (MAD) | 3800 |
| Skewness | 1.1797018 |
| Sum | 1.7249684 × 109 |
| Variance | 39877129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 14468 | 8.0% |
| 5000 | 11148 | 6.2% |
| 6000 | 9704 | 5.4% |
| 12000 | 9664 | 5.4% |
| 15000 | 8016 | 4.5% |
| 8000 | 7712 | 4.3% |
| 4000 | 5624 | 3.1% |
| 20000 | 5540 | 3.1% |
| 3000 | 5512 | 3.1% |
| 7000 | 5256 | 2.9% |
| Other values (4473) | 97356 |
| Value | Count | Frequency (%) |
| 500 | 20 | |
| 563 | 4 | < 0.1% |
| 700 | 4 | < 0.1% |
| 725 | 4 | < 0.1% |
| 750 | 4 | < 0.1% |
| 800 | 4 | < 0.1% |
| 900 | 8 | < 0.1% |
| 912 | 4 | < 0.1% |
| 922 | 4 | < 0.1% |
| 950 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 936 | |
| 34826 | 4 | < 0.1% |
| 34800 | 4 | < 0.1% |
| 34664 | 4 | < 0.1% |
| 34375 | 4 | < 0.1% |
| 34322 | 4 | < 0.1% |
| 34121 | 4 | < 0.1% |
| 34000 | 16 | < 0.1% |
| 33950 | 8 | < 0.1% |
| 33800 | 4 | < 0.1% |
loan_intent
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.5 MiB |
| EDUCATION | |
|---|---|
| MEDICAL | |
| VENTURE | |
| PERSONAL | |
| DEBTCONSOLIDATION |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 10.012711 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PERSONAL |
|---|---|
| 2nd row | EDUCATION |
| 3rd row | MEDICAL |
| 4th row | MEDICAL |
| 5th row | MEDICAL |
Common Values
| Value | Count | Frequency (%) |
| EDUCATION | 36612 | |
| MEDICAL | 34192 | |
| VENTURE | 31276 | |
| PERSONAL | 30208 | |
| DEBTCONSOLIDATION | 28580 | |
| HOMEIMPROVEMENT | 19132 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| education | 36612 | |
| medical | 34192 | |
| venture | 31276 | |
| personal | 30208 | |
| debtconsolidation | 28580 | |
| homeimprovement | 19132 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 249540 | |
| O | 190824 | |
| N | 174388 | |
| I | 147096 | |
| T | 144180 | |
| A | 129592 | 7.2% |
| D | 127964 | 7.1% |
| C | 99384 | 5.5% |
| L | 92980 | 5.2% |
| M | 91588 | 5.1% |
| Other values (7) | 354752 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1802288 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 249540 | |
| O | 190824 | |
| N | 174388 | |
| I | 147096 | |
| T | 144180 | |
| A | 129592 | 7.2% |
| D | 127964 | 7.1% |
| C | 99384 | 5.5% |
| L | 92980 | 5.2% |
| M | 91588 | 5.1% |
| Other values (7) | 354752 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1802288 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 249540 | |
| O | 190824 | |
| N | 174388 | |
| I | 147096 | |
| T | 144180 | |
| A | 129592 | 7.2% |
| D | 127964 | 7.1% |
| C | 99384 | 5.5% |
| L | 92980 | 5.2% |
| M | 91588 | 5.1% |
| Other values (7) | 354752 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1802288 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 249540 | |
| O | 190824 | |
| N | 174388 | |
| I | 147096 | |
| T | 144180 | |
| A | 129592 | 7.2% |
| D | 127964 | 7.1% |
| C | 99384 | 5.5% |
| L | 92980 | 5.2% |
| M | 91588 | 5.1% |
| Other values (7) | 354752 |
loan_int_rate
Real number (ℝ)
| Distinct | 1302 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.006606 |
| Minimum | 5.42 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 5.42 |
|---|---|
| 5-th percentile | 6.17 |
| Q1 | 8.59 |
| median | 11.01 |
| Q3 | 12.99 |
| 95-th percentile | 16 |
| Maximum | 20 |
| Range | 14.58 |
| Interquartile range (IQR) | 4.4 |
Descriptive statistics
| Standard deviation | 2.9787835 |
|---|---|
| Coefficient of variation (CV) | 0.27063597 |
| Kurtosis | -0.42040028 |
| Mean | 11.006606 |
| Median Absolute Deviation (MAD) | 2.13 |
| Skewness | 0.21377873 |
| Sum | 1981189 |
| Variance | 8.8731509 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.01 | 13316 | 7.4% |
| 10.99 | 3216 | 1.8% |
| 7.51 | 3192 | 1.8% |
| 7.49 | 2748 | 1.5% |
| 7.88 | 2692 | 1.5% |
| 5.42 | 2432 | 1.4% |
| 7.9 | 2424 | 1.3% |
| 11.49 | 2056 | 1.1% |
| 9.99 | 1936 | 1.1% |
| 13.49 | 1900 | 1.1% |
| Other values (1292) | 144088 |
| Value | Count | Frequency (%) |
| 5.42 | 2432 | |
| 5.43 | 8 | < 0.1% |
| 5.44 | 8 | < 0.1% |
| 5.46 | 4 | < 0.1% |
| 5.47 | 20 | < 0.1% |
| 5.48 | 16 | < 0.1% |
| 5.49 | 16 | < 0.1% |
| 5.5 | 4 | < 0.1% |
| 5.51 | 12 | < 0.1% |
| 5.52 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 336 | |
| 19.91 | 36 | < 0.1% |
| 19.9 | 4 | < 0.1% |
| 19.82 | 20 | < 0.1% |
| 19.8 | 4 | < 0.1% |
| 19.79 | 16 | < 0.1% |
| 19.74 | 16 | < 0.1% |
| 19.69 | 48 | < 0.1% |
| 19.66 | 12 | < 0.1% |
| 19.62 | 4 | < 0.1% |
loan_percent_income
Real number (ℝ)
High correlation 
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13972489 |
| Minimum | 0 |
|---|---|
| Maximum | 0.66 |
| Zeros | 108 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.03 |
| Q1 | 0.07 |
| median | 0.12 |
| Q3 | 0.19 |
| 95-th percentile | 0.31 |
| Maximum | 0.66 |
| Range | 0.66 |
| Interquartile range (IQR) | 0.12 |
Descriptive statistics
| Standard deviation | 0.087211581 |
|---|---|
| Coefficient of variation (CV) | 0.6241664 |
| Kurtosis | 1.082226 |
| Mean | 0.13972489 |
| Median Absolute Deviation (MAD) | 0.05 |
| Skewness | 1.0344863 |
| Sum | 25150.48 |
| Variance | 0.0076058599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08 | 10372 | 5.8% |
| 0.1 | 9684 | 5.4% |
| 0.07 | 9660 | 5.4% |
| 0.09 | 9180 | 5.1% |
| 0.06 | 8968 | 5.0% |
| 0.12 | 8864 | 4.9% |
| 0.05 | 8704 | 4.8% |
| 0.11 | 8632 | 4.8% |
| 0.14 | 7840 | 4.4% |
| 0.04 | 7800 | 4.3% |
| Other values (54) | 90296 |
| Value | Count | Frequency (%) |
| 0 | 108 | 0.1% |
| 0.01 | 1260 | 0.7% |
| 0.02 | 3776 | 2.1% |
| 0.03 | 5952 | |
| 0.04 | 7800 | |
| 0.05 | 8704 | |
| 0.06 | 8968 | |
| 0.07 | 9660 | |
| 0.08 | 10372 | |
| 0.09 | 9180 |
| Value | Count | Frequency (%) |
| 0.66 | 4 | < 0.1% |
| 0.63 | 4 | < 0.1% |
| 0.62 | 8 | < 0.1% |
| 0.61 | 8 | < 0.1% |
| 0.59 | 4 | < 0.1% |
| 0.58 | 4 | < 0.1% |
| 0.57 | 4 | < 0.1% |
| 0.56 | 20 | |
| 0.55 | 20 | |
| 0.54 | 32 |
loan_status
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.0 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 140000 | |
| 1 | 40000 | 22.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 140000 | |
| 1 | 40000 | 22.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 140000 | |
| 1 | 40000 | 22.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 140000 | |
| 1 | 40000 | 22.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 140000 | |
| 1 | 40000 | 22.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 180000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 140000 | |
| 1 | 40000 | 22.2% |
previous_loan_defaults_on_file
Boolean
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 175.9 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 91432 | |
| False | 88568 |
Interactions
Correlations
| cb_person_cred_hist_length | loan_amnt | loan_int_rate | loan_intent | loan_percent_income | loan_status | person_age | person_education | person_emp_exp | person_gender | person_home_ownership | person_id | person_income | previous_loan_defaults_on_file | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| cb_person_cred_hist_length | 1.000 | 0.043 | 0.017 | 0.055 | -0.037 | 0.024 | 0.821 | 0.092 | 0.750 | 0.029 | 0.030 | 0.128 | 0.093 | 0.029 |
| loan_amnt | 0.043 | 1.000 | 0.105 | 0.033 | 0.666 | 0.126 | 0.064 | 0.012 | 0.052 | 0.013 | 0.091 | 0.017 | 0.405 | 0.068 |
| loan_int_rate | 0.017 | 0.105 | 1.000 | 0.021 | 0.124 | 0.363 | 0.013 | 0.013 | 0.016 | 0.008 | 0.085 | 0.005 | -0.033 | 0.198 |
| loan_intent | 0.055 | 0.033 | 0.021 | 1.000 | 0.022 | 0.142 | 0.032 | 0.015 | 0.031 | 0.005 | 0.083 | 0.032 | 0.013 | 0.081 |
| loan_percent_income | -0.037 | 0.666 | 0.124 | 0.022 | 1.000 | 0.415 | -0.056 | 0.011 | -0.050 | 0.009 | 0.092 | -0.002 | -0.353 | 0.220 |
| loan_status | 0.024 | 0.126 | 0.363 | 0.142 | 0.415 | 1.000 | 0.017 | 0.005 | 0.018 | 0.000 | 0.258 | 0.093 | 0.013 | 0.543 |
| person_age | 0.821 | 0.064 | 0.013 | 0.032 | -0.056 | 0.017 | 1.000 | 0.061 | 0.888 | 0.026 | 0.019 | 0.122 | 0.143 | 0.032 |
| person_education | 0.092 | 0.012 | 0.013 | 0.015 | 0.011 | 0.005 | 0.061 | 1.000 | 0.066 | 0.002 | 0.010 | 0.041 | 0.010 | 0.041 |
| person_emp_exp | 0.750 | 0.052 | 0.016 | 0.031 | -0.050 | 0.018 | 0.888 | 0.066 | 1.000 | 0.024 | 0.015 | 0.101 | 0.120 | 0.031 |
| person_gender | 0.029 | 0.013 | 0.008 | 0.005 | 0.009 | 0.000 | 0.026 | 0.002 | 0.024 | 1.000 | 0.000 | 0.000 | 0.013 | 0.000 |
| person_home_ownership | 0.030 | 0.091 | 0.085 | 0.083 | 0.092 | 0.258 | 0.019 | 0.010 | 0.015 | 0.000 | 1.000 | 0.061 | 0.012 | 0.140 |
| person_id | 0.128 | 0.017 | 0.005 | 0.032 | -0.002 | 0.093 | 0.122 | 0.041 | 0.101 | 0.000 | 0.061 | 1.000 | 0.025 | 0.037 |
| person_income | 0.093 | 0.405 | -0.033 | 0.013 | -0.353 | 0.013 | 0.143 | 0.010 | 0.120 | 0.013 | 0.012 | 0.025 | 1.000 | 0.012 |
| previous_loan_defaults_on_file | 0.029 | 0.068 | 0.198 | 0.081 | 0.220 | 0.543 | 0.032 | 0.041 | 0.031 | 0.000 | 0.140 | 0.037 | 0.012 | 1.000 |
Missing values
Sample
| person_id | person_age | person_gender | person_education | person_income | person_emp_exp | person_home_ownership | cb_person_cred_hist_length | loan_amnt | loan_intent | loan_int_rate | loan_percent_income | loan_status | previous_loan_defaults_on_file | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 22 | female | Master | 71948.0 | 0.0 | RENT | 3.0 | 35000.0 | PERSONAL | 16.02 | 0.49 | 1 | No |
| 1 | 2 | 21 | female | High School | 12282.0 | 0.0 | OWN | 2.0 | 1000.0 | EDUCATION | 11.14 | 0.08 | 0 | Yes |
| 2 | 3 | 25 | female | High School | 12438.0 | 3.0 | MORTGAGE | 3.0 | 5500.0 | MEDICAL | 12.87 | 0.44 | 1 | No |
| 3 | 4 | 23 | female | Bachelor | 79753.0 | 0.0 | RENT | 2.0 | 35000.0 | MEDICAL | 15.23 | 0.44 | 1 | No |
| 4 | 5 | 24 | male | Master | 66135.0 | 1.0 | RENT | 4.0 | 35000.0 | MEDICAL | 14.27 | 0.53 | 1 | No |
| 5 | 6 | 21 | female | High School | 12951.0 | 0.0 | OWN | 2.0 | 2500.0 | VENTURE | 7.14 | 0.19 | 1 | No |
| 6 | 7 | 26 | female | Bachelor | 93471.0 | 1.0 | RENT | 3.0 | 35000.0 | EDUCATION | 12.42 | 0.37 | 1 | No |
| 7 | 8 | 24 | female | High School | 95550.0 | 5.0 | RENT | 4.0 | 35000.0 | MEDICAL | 11.11 | 0.37 | 1 | No |
| 8 | 9 | 24 | female | Associate | 100684.0 | 3.0 | RENT | 2.0 | 35000.0 | PERSONAL | 8.90 | 0.35 | 1 | No |
| 9 | 10 | 21 | female | High School | 12739.0 | 0.0 | OWN | 3.0 | 1600.0 | VENTURE | 14.74 | 0.13 | 1 | No |
| person_id | person_age | person_gender | person_education | person_income | person_emp_exp | person_home_ownership | cb_person_cred_hist_length | loan_amnt | loan_intent | loan_int_rate | loan_percent_income | loan_status | previous_loan_defaults_on_file | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179990 | 179991 | 31 | male | Master | 136832.0 | 9.0 | RENT | 7.0 | 12319.0 | PERSONAL | 16.92 | 0.09 | 1 | No |
| 179991 | 179992 | 24 | male | High School | 37786.0 | 0.0 | MORTGAGE | 4.0 | 13500.0 | EDUCATION | 13.43 | 0.36 | 1 | No |
| 179992 | 179993 | 23 | female | Bachelor | 40925.0 | 0.0 | RENT | 4.0 | 9000.0 | PERSONAL | 11.01 | 0.22 | 1 | No |
| 179993 | 179994 | 27 | female | High School | 35512.0 | 4.0 | RENT | 5.0 | 5000.0 | PERSONAL | 15.83 | 0.14 | 1 | No |
| 179994 | 179995 | 24 | female | Associate | 31924.0 | 2.0 | RENT | 4.0 | 12229.0 | MEDICAL | 10.70 | 0.38 | 1 | No |
| 179995 | 179996 | 27 | male | Associate | 47971.0 | 6.0 | RENT | 3.0 | 15000.0 | MEDICAL | 15.66 | 0.31 | 1 | No |
| 179996 | 179997 | 37 | female | Associate | 65800.0 | 17.0 | RENT | 11.0 | 9000.0 | HOMEIMPROVEMENT | 14.07 | 0.14 | 1 | No |
| 179997 | 179998 | 33 | male | Associate | 56942.0 | 7.0 | RENT | 10.0 | 2771.0 | DEBTCONSOLIDATION | 10.02 | 0.05 | 1 | No |
| 179998 | 179999 | 29 | male | Bachelor | 33164.0 | 4.0 | RENT | 6.0 | 12000.0 | EDUCATION | 13.23 | 0.36 | 1 | No |
| 179999 | 180000 | 24 | male | High School | 51609.0 | 1.0 | RENT | 3.0 | 6665.0 | DEBTCONSOLIDATION | 17.05 | 0.13 | 1 | No |